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EXTRA CTION AND UTILISATION OF VNT R ALLE1..FS 
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Glossary of Terms and Abbreviations 



AFLP 
allele 

10 DNA 

DNA fingerprint 

individual 
heteroduplex 
is heterozygygous 

homoduplex 
homozygous 

20 locus 

mis-match 

NASBA 
PCR 
25 RAPD 
RDA 
RFLP 
trait 



amplified fragment length polymorphism 

one of several possible alternative sequence variations 

at any one locus 

deoxyribonucleic acid 

the display of a set of DNA fragments from a specific 
DNA sample 

a member of any species subject to investigation 

a duplex of two different alleles of any one locus 

alleles at the same locus of each of the paired 

chromosomes in a diploid cell being different 

a duplex of identical alleles of any one locus 

alleles at the same locus of the paired chromosomes of 

a diploid ceil being identical 

a specific position on a chromosome 

one or more bases in a duplex that fail to form stable 

hydrogen bonds with apposing bases 

nucleic acid sequence based amplification 

polymerase chain reaction 

random-amplified DNA markers 

representational difference analysis 

restriction fragment length polymorphism 

a distinguishing feature or characteristic manifesting 

itself physically, chemically or biologically 
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Iotal Representation of Alleles that are Informative for a 
Irait 

variable number tandem repeat, also referred to as 
simple sequence repeats (encompassing all repeats of 
two or more nucleotides that may be continuous or 
interrupted by short non-repetitive sequence, including 
minisatellites and microsatellites). 

Field of the inven^n 

10 The field of this invention is the detection of polymorphic 

variation in complex genomes, which is the mainstay of the study of 
hereditary traits in all organisms. Since polygenic traits far outweigh those 
that are monogenic, a procedure that allows the isolation in concert of 
several informative polymorphisms within the complex genomes of multiple 

15 individuals would provide an extremely powerful tool for the investigation of 
hereditary traits. 

The invention differs fundamentally from all other techniques 
that have been previously employed by: 

(') permitting mass generation of VNTRs quickly and easily from 

20 DNA 

(") generating polymorphisms that are both linked and 

informative for a trait; 

('") reproducing and pFeserving the polymorphic allele, as it 

occurs in the genome; 
25 ( iv ) negating problems that are features of other polymerase 

chain reaction based techniques; including miss-priming, reaction 
contamination and generation of spurious products: 

(v) negating the need for investigations to be confined to families 

of closely-related individuals; 
30 ( vi ) permitting the analysis of polygenic traits; 



TRAIT 
VNTR 
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(vii) having a sparing requirement for DNA starting material. 

The invention therefore represents a major advancement in 
the ability of workers in the biomedical fields to screen simple or complex 
genomes, rapidly and with fidelity, for polymorphisms co-segregating with 
advantageous or deleterious monogenic or polygenic hereditary traits. 
There is enormous potential for advancement of medicine, veterinary 
medicine, forensic science, agriculture, animal husbandry and 
biotechnology, by the generation of polymorphic markers co-segregating 
with hereditary disease or traits of social or economic importance. The 
invention will also serve to facilitate mutation analysis for all relevant 
organisms. 

Introduction 

DNA is a double stranded linear polymer composed of 
repetitions of four mononucleotide units. The sequence in which these 
units are arranged gives rise to a genetic code, referred to as the genome. 
Although the genomes of all individuals within a species are essentially 
homologous, subtle variations exist which impart individuality. Locations of 
the genome at which more than one sequence variation may exist are 
termed polymorphisms, each variant of that sequence representing an 
allele. Polymorphisms in gamete-forming germinal cells will be inherited by 
subsequent generations of progeny. By studying the combination of 
polymorphisms in the genome of an individual a unique code ('fingerprint*) 
can be assigned and the ancestry of that individual can be determined. 
Furthermore, a polymorphism found to be linked and co-segregating with a 
particular genetic trait or hereditary disease may be used as a marker for 
genetic screening of that trait or disease in other individuals. 

The study of advantageous or deleterious hereditary traits in 
complex genomes has been the subject of considerable interest due to its 
economical, medical and social implications. The establishment of 
protocols that allow the comparison of nucleic acid sequences in complex 
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genomes and ,he isolation of differences unique to a subset of those 
sequences is a fundamental requirement of this field of study. 

A number of protocols have been used in animals and plants 
or the comparison o, nucleic acid sequences and isolation of difference 
between those sequences in individuals. These protocols involve 
restriction fragment length polymorphism (RFLP), random-amplified 
poiymorphic DNA markers (RAPD), amplified fragment length 
polymorphism (AFLP), representational difference analysis (RDA) and 
linkage analysis of variable number tandem repeats (VNTR, These 
protocols detect polymorphisms by assaying subsets of the total DNA 
sequence variation in a genome. Polymorphisms detected by RFLP 
AFLP, and RDA rely on the generation of a fingerprint ladder by gel-' 
electrophoresis which reflects restriction fragment size variation RAPD 
polymorphisms result from sequence variation a. primer binding sites and 
differences in length between primer binding sKes [1,. Linkage analysis 
•nvolves the detection of length variation of variable number tandem 
repeats (VNTRs) and co-segregation of one allele with a trai, of interest 



RFLP 
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25 



RFLP analysis relies on the cleavage of a nucleic acid 
sequence by restriction endonucleases and separation of the resulting 
fragments by gel electrophoresis. The fragments are blotted onto a 

IT" T hybr ' diZed '° ' abe " ed PTObeS '° a " W 

length var,a„on [2, 3,. This technique may be of use in the study of a 

s.ng,e isolated locus or gene fragment, bu, where an investigation is not 
confined to an isolated sequence » is inadequate. Further limitations are 
that only a small number of the polymorphisms generated may be 
informative, there is a high demand for DNA starting materia,, and the 
method is labour Intensive [3]. 
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RAPD 

RAPD is a commonly used PCR-based polymorphic marker 
technique in genomic fingerprinting and diversity studies, particularly for 
plant species. This technique involves the use of a single 'arbitrary primer' 
5 which gives rise to amplification of regions of genome where there is 

sufficient homology between the sequences of genomic DNA, in the 5' to 3' 
direction, and that of the arbitrary primer. The amplified products are 
separated by gel electrophoresis [3]. Subtle variations of this method 
include arbitrary primed-PCR (AP-PCR) and DNA amplification 
10 fingerprinting (DAF). However, the principle of arbitrary priming and 
amplification of DNA by PCR for difference analysis is common to all. 
Advantages compared to RFLP are that these methods are more rapid, 
have a lower demand for DNA, and do not require prior knowledge of 
sequence. A limitation in common with RFLP is that each analysis can 
15 only compare the genomes of two individuals. Although several loci can 
be evaluated concomitantly by this method, detection of polymorphisms 
requires observation of variation in band patterns by gel-electrophoresis 
and is subject to errors of superimposition of different alleles of similar 
electrophoretic mobility. Many bands may be faint and difficult to interpret, 
20 and it is difficult to achieve consistent results in repeat experiments. In 

common with the majority of PCR techniques, the results are prone to error 
by subtle changes in reaction conditions, reagent contamination, and the 
generation of inconsistent banding patterns. This lack of reliability limits 
the usefulness of such techniques in the 'typing' of individuals [2]. 

25 

AFLP 

AFLP analysis (EP, A, 0534858; Zabeau M era/.) involves 
restriction endonuclease digestion of DNA and ligation of the generated 
restriction fragments to adaptors. Using primers complementary to the 
30 adaptor sequence, the restriction fragments are amplified by PCR, and the 
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products are separated by gel-electrophoresis, differences in band patterns 
revealing polymorphisms [2]. Microsatellite-AFLP (WO 96/22388; Kuiper M 
et a/.) is a modification of this technique in which two or more restriction 
enzymes, at least one of which cuts at a simple sequence repeat, are used 
5 to cleave DNA into fragments that are ligated to adaptors. The fragments 
are amplified with primers complementary to the adaptor sequence. In 
common with RAPD, several loci can be evaluated concomitantly by this 
method, but detection of polymorphisms requires observation of variation in 
band patterns by gel-electrophoresis and is subject to errors of 
io superimposition of different alleles of similar electrophoretic mobility. The 
ability to score bands on an AFLP fingerprint is compromised by generation 
of large numbers of bands of which some may be very faint and difficult to 
interpret [6J. Furthermore, the technique is prone to errors that are 
common to all PCR based techniques, summarised above, and suffers 
15 from an inability to analyse multiple complex genomes simultaneously. 
This is compounded by the generation of bands, by incomplete restriction 
of the template DNA, that do not reflect true polymorphisms [7]. AFLP and 
RAPD analyses therefore share many of the same limitations. An 
additional problem is that AFLPs, rather than being evenly dispersed 
20 through out the genome, are reported to be clustered around centromeres 
[2]. Consequently, this method may not allow the generation of 
polymorphisms that co-segregate with sequence differences of interest if 
they are located at a distance from centromeres. This problem is reflected 
in the reduced rate of polymorphism detection compared to techniques 
25 such as linkage analysis [1]. Furthermore, the complexity of the 

experimental data derived by AFLP becomes exaggerated with increasing 
complexity of the genome subject to analysis. Consequently, although it 
has been possible to investigate the genomes of some plant species by 
AFLP analysis, the relatively complex genomes of higher eukaryotic 
30 species may be beyond the useful capacity of this technique. 



RDA 

RDA involves restriction endonuclease digestion of DNA, 
ligation of the fragments to adaptors and amplification by PCR. 
Differences between compared genomes are selected by successive 
rounds of subtractive hybridization and kinetic enrichment such that 
regions of difference predominate [5]. This technique is prone to 
erroneous results through reaction contamination and generation of 
spurious products. In addition, a fundamental requirement of RDA is the 
availability of families of closely related individuals, some of which are 
manifesting the trait of interest. Where RDA is performed on anything 
other than closely related or highly inbred genomes the multiplicity of 
differences is too vast for succinct and useful analysis. 

In all of the above protocols it is essential that there is a 
difference in nucleotide sequence at or between primer binding sites or 
endonuclease restriction sites in order to detect polymorphisms. This 
highlights the major limitations of these procedures, because in many 
instances a mutation giving rise to a hereditary trait will not create a 
sequence difference detectable by variation in primer binding or restriction 
enzyme digestion. Consequently, a polymorphism linked to a trait of 
interest will not be identified using these techniques. Furthermore, in 
contrast to VNTR polymorphisms, primer and restriction site 
polymorphisms are not hypervariable, and the majority of polymorphisms 
based on variation in primer binding or restriction enzyme digestion are not 
informative. 

Linkage analysis 

Linkage analysis is an indirect molecular genetic strategy that 
involves the systematic comparison of the inheritance of polymorphic 
VNTRs with the trait of interest in families in which that trait is present. 
There are a number of types of VNTR, including minisatellites and 
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microsatellites, a feature of all of which is the repetition of elements of 
simple sequences. They are polymorphic by virtue of variation in the 
number of times each element is repeated, giving rise to alleles with 
variation in length [2].Since several alternative alleles may exist at any one 
5 locus, in contrast to polymorphisms based on variation in primer binding or 
restriction enzyme digestion, VNTR polymorphic alleles tend to be highly 
informative. Consequently, where co-segregation of a trait with a particular 
VNTR allele is demonstrated, the allele may be used as a marker for that 
trait, or may be used as a vehicle to facilitate identification of the molecular 
10 genetic basis of the trait. Microsatellites are ubiquitously distributed 
throughout all eukaryotic genomes [4]. Consequently, linkage analysis 
with microsatellites is associated with the highest polymorphism detection 
rate of the genetic screening methods [1]. Indeed, systematic 
microsatellite analyses have already been responsible for many advances 

is in the understanding of certain types of common cancer [1]. Linkage 

analysis therefore has advantages compared to other related methods of 
difference analysis, the results of which are very reproducible [2]. 
However, linkage analysis is very time consuming, labour intensive and 
expensive. Furthermore, since many analyses are performed individually 

20 the overall requirement for DNA is extremely high. This is particularly true 
if a physical map of the genome is unavailable for the selection of 
informative microsatellites that are evenly distributed throughout the 
genome. The demonstration of Hnkage requires the application of 
elaborate statistical programs and powerful computer software for analysis 

25 of the experimental data. This technique is better suited to monogenic 
defects since the statistical analyses required for multigenic traits are 
particularly complex. Unfortunately, multifactorial genetic traits are far 
more prevalent than monogenic defects, making linkage analysis a 
cumbersome technique for the investigation of the majority of hereditary 

30 traits [4]. 



The characteristics of an ideal protocol for isolation of 
polymorphisms co-segregating with disease in complex genomes would 
include: 

(') the ability to isolate simultaneously and with fidelity the 

polymorphisms from complex genomes of several individuals 
(") the ability to isolate several polymorphisms simultaneously, 

permitting the analysis of polygenic traits 

("0 a high detection rate of polymorphisms that co-segregate with 

sequence differences in all eukaryotic species, including subtle differences 
such as those resulting from point mutations 

< iv ) no requirement for large families of closely related individuals 

to study traits of interest 

( v ) n o requirement for physical maps of the genome or prior 
knowledge of genomic sequence 

( vi ) a requirement for sparing quantities of nucleic acid samples 
for analysis 

(vii) simplicity of use without a need for expensive specialist 
laboratory equipment or computer software 

(viii) potential for widespread application throughout the animal 
and plant kingdoms 

( ix ) a robust performance with precision, accuracy and fidelity. 
None of the techniques that are currently available fulfil the 

majority of these ideal characteristics. All are compromised by at least one 
of several limitations including: expense; lack of speed; requirement for 
large amounts of DNA; low polymorphism detection rate; an inability to 
detect small sequence variations such as point mutations; a lack of fidelity 
with high incidence of artefacts and spurious results; inability to analyse 
several complex genomes concomitantly; an inability to resolve 
simultaneously polymorphisms at multiple loci; an intrinsic need for closely 
related genomes for analysis; a need for prior knowledge of sequence; and 
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Bepresentation of Alleles Informative for a Irait (TRAIT). 

In one aspect the invention provides a method of making a 

™xture of VNTR alleles and .helping regions of the genomic oZ f 

reZ:: membera of a species °' ^ * ch — 

a) dividing genomic DNA of the species of interest into 

fragments, 

« gating to each end of each fragment an adap ,or having a 

-esse 3'-end which is blocked to prevent enzyme chain 
thereby form.ng a mixture of a d ap tor-.erminated fragments. 



c ) using a portion of the mixture of adaptor-terminated 
fragments as templates with an adaptor primer and a VNTR primer to 
create a mixture of 5-flanking VNTR amplimers, 

d ) using a portion of the mixture of adaptor-terminated 
fragments as templates with an adaptor primer and a VNTR antisense 
primer to create a mixture of 3'-flanking VNTR amplimers, 

e ) and using genomic DNA of the one or more members of the 
species of interest as template with the mixture of 5'-flanking VNTR 
amplimers and the mixture of 3'-flanking VNTR amplimers as primers to 
make the desired mixture of VNTR alleles and their flanking regions. 

The species of interest may be any eukaryotic species from 
the plant and animal kingdoms. Although they do not show repetitive 
sequences in quite the same way, prokaryotic species are also envisaged. 
An individual member of a species may be for example a plant or a micro- 
organism or an animal such as a mammal. 

In another aspect the invention provides a portion of genomic 
DNA of one or more members of a species of interest, said portion 
consisting essentially of a representative mixture of alleles of a chosen 
VNTR sequence and their flanking regions. 

The term "representative mixture of alleles" does not 
necessarily imply that all of the possible alleles, or even most of these 
possible alleles, of a chosen VNTR sequence are present. Whether a 
particular allele is present or not, 'e.g. in the mixture generated by the 
method defined above, may depend on the nature of a restriction enzyme 
used in step a) and on other factors. 

The invention also provides a portion of genomic DNA of a 
species of interest, said portion consisting essentially of a representative 
mixture of 3'-flanking regions of a chosen VNTR sequence, each member 
of the mixture carrying an adaptor at its 3'-end. 

The invention also provides a portion of genomic DNA of a 
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species of interest, said portion consisting essentially of a representative 
m,xture of 5'-flanking regions of a chosen VNTR sequence, each member 
of the mixture carrying an adaptor at its 5'-end. 

The invention also provides a method of treating a mixture of 
po«ymorphic alle.es, e.g. of a chosen VNTR sequence and their flanking 
reg.ons, or alternatively a mixture generated in some other way such as 
AFLP, microsateilite-AFLP or RAPD, the mixture being representative of 
those which manifest a trait of interest, which method comprises separating 
and then re-annealing strands of the mixture, and separating and 
discarding any mis-matches. Preferably the method comprises the 
additional step of hybridizing the said mixture with a mixture of 
corresponding polymorphic alleles, e.g. of the chosen VNTR sequence and 
their flanking regions, or alternatively a mixture generated in some other 
way such as AFLP, microsatel.ite-AFLP or RAPD, which are representative 
of those which do not show the trait of interest, and selecting mis-matches 
to prov.de a mixture of polymorphic alleles which are characteristic of the 
trait of interest. 

The invention also provides kits comprising protocols and 
reagents for performing the methods herein described. 

The salient points of the invention may be represented as 

follows: 

(0 reduction in the complexity of the genome by double positive 

selects of genomic DNA restriction fragments that both ligate to a chosen 
adaptor and contain a sequence with homology to a chosen primer 
employing enrichment of such products by PGR. NASBA or other methods- 
CO introduction of the selected enriched fragments to a genomic ' 

template in such a way that allows recreation of the VNTRs with the 
flanking sequences within that template, whilst preserving the allele and 
therefore the informativeness of each locus; 

(iii) mis-match discrimination of the generated VNTR alleles to 
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remove any spurious products of amplification that occur through miss 
priming events, reaction contamination, and subtle variation in reaction 
conditions; 

(iv) selection of only those synthesised VNTRs alleles that are 

5 common to all individuals manifesting a particular trait. This is achieved by 
strand dissociation and hybridization, giving rise to mis-match containing 
heteroduplexes of alleles at any locus that differ among the individuals. 
These complexes can be rejected by mis-match discrimination. The 
enriched alleles that are common to all individuals manifesting the trait are 
10 sufficiently pure to be used as starting material in other DNA based studies 
that utilise polymorphic alleles; 

(iv) rejection of those alleles common to all individuals 

manifesting a particular trait that are also common to individuals in which 
the trait is not present. This is achieved by strand dissociation and 

is hybridization of the VNTR alleles that are common to all individuals 

manifesting a particular trait of interest with the VNTR alleles of individuals 
in which the trait is not present followed by a second round of mis-match 
discrimination. In this case mis-match containing heteroduplexes are 
selected. These represent polymorphic VNTRs with an informative allele 

20 that co-segregates with the particular trait of interest. Amplification of 

these VNTRs from the DNAs of individuals manifesting the trait of interest 
yields the informative alleles that may be used as DNA markers. 

The invention provides a method of selecting genetic 
elements that are common to one pool of individuals but are absent in a 

25 second. An obvious variation on this theme is the selection of genetic 
elements that are absent in one pool of individuals but are present in a 
second by judicious selection, during the course of the procedure, of allele 
duplexes that are either with or without a mis-match. 
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1 . Recreation of the VNTRs with the flanking sequences 

Genomic DNA from an individual of the species in which an 
investigation is to be performed, but not necessarily an individual in that 
investigation, is digested by one or more restriction endonucleases to 
fragment the genome. 



Genomic DNA of species of interest 



Digest DNA to generate restriction 
fragments, including those which 
wil l contain VN TRs 

( IW ) 



The genomic fragments are ligated to adaptors that possess 
a recessed 3* end which is blocked chemically, or by other methods, to 
prevent strand extension in the presence of DNA polymerase. The 
adaptors may be blocked prior to ligation to the genomic fragments or after 
the ligation has occurred. 



Ligaie adaptors 



Adaptors ii gated and blocked (•) lo prevent filling in of 
the recessed strand. The adaptor primer only binds to the 
adaptor region when the binding site is created by the 
extcntion of s VNTR primer 



In the absence of strand extension there is no adaptor primer 
binding site and exponential amplification of the ligated fragment using the 
adaptor primer is not possible. However, when a chosen primer, with 
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homology to the ligated genomic fragment, anneals and extends to the limit 
. of the adaptor, an adaptor primer binding site is created. All products of 
amplification, therefore, are both ligated to the chosen adaptor and contain 
sequence homologous to that of the chosen primer. 

In separate reactions the adaptor primer is used with either a 
VNTR sense primer or a VNTR antisense primer to produce the 3' and 5' 
flanking sequences of each VNTR, respectively, by PCR, NASBA, or other 
methods. 



In one tube a VNTR primer | 
is annealed 



and extended, creating an adaptor primer 
binding site. This only happens if a VNTR is 
f present in the fragment. 



In a second tube a VNT R complementary 
sequence primer ) is annealed 



and extended, creating an adaptor primer 
binding site. This only happens if a VNTR is 
present in the fragment. 



The repeat (^HH ) primed product is 
amplified in the presence of both 
ihe repeat primer and adaptor primer 
to generate one Hank of the VNTR fragments 
* lacking VNTR sequence fail to amplify 



The anti-sense )primed product is 

amplified in the presence of both 
the repeat primer and adaptor primer 
to generate the other flank of the VNTR 
Fragments lacking VNTR sequence fail 
to amplify 



10 
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5' Flanking VNTR amplimer. 



3' Flanking VNTR amplimer 



The amplimers are purified from the VNTR primers by 
physical, chemical or enzymic means and size selection is performed to 
obtain amplimers of optimal size. The purified amplimers are then used to 
generate each VNTR allele present in the genomic DNA of one or more 
individuals manifesting a particular trait of interest. Where a pool of 
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Extention of the annealed 3' end bv 
polymerase. 
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Addition of the 3* amplimers, generated by the VNTR sense 
primer, with thermal cycling achieves successive rounds of strand 
dissociation and reannealing with the generation of each VNTR allele as 
present in the genomic DNA of those individuals under investigation. 




Amplification in the presence of adapter 
primer alone gives rise to a pool of 
VNTRs with their informativeness 
preserved. 




It is also possible to generate the full length VNTRs, with their 
flanking sequences, from the genomic DNA of individuals manifesting a 

10 particular trait by: 

(0 hybridization of genomic DNA of individuals manifesting a 

particular hereditary trait to amplimers representing the 3' region of each 
VNTR, created using the VNTR sense primer. The hybridization is 
incubated with DNA polymerase after which the antisense primed 

15 amplimers are added; 
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00 hybridization of genomic DNA of individuals manifesting a 

particular hereditary trait to both amplimer pools, simultaneously, created 
by the sense and antisense VNTR primers. The hybridization is then 
incubated with DNA polymerase. 



2. Mis-match discrimination of the generated VNTRs with flanking 
sequences from individuals under investigation. 

The VNTR alleles with flanking sequences generated from 
the genomic DNAs of individuals manifesting a particular trait, synthesized 
individually or collectively, are hybridized. If different alleles exist at any 
locus in the genomes of those individuals mis-match containing 
heteroduplex formation may occur. 



Alleles that are shared by all individuals Alleles that differ among individuals 



t 



Denature and anneal VNTRs with locus specific 
flanking sequences from affected and 
unaffected to produce heteroduplexes. J 





Mis-match containing heteroduplexes are discarded by mis- 
match discrimination resulting in the enrichment of those alleles common to 
all individuals manifesting the trait. In addition, this procedure removes the 
spurious products of amplification resulting from miss-priming events or 
reaction contamination since these spurious products will not be constant 
among all individuals. The mis-match discrimination may be achieved by: 
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(') partial digestion of the mis-match containing heteroduplexes 

and incubation with a labelled dNTP. Labelled duplexes that contained 
mis-matches may be bound to a support from which the mis-match free 
duplexes can be separated; 




Strand extention with hapten labelled 
nucleotides (*) and replacement of existing 
strand occurs only where alleles are 1 
not shared. These labelled duplexes are ' 
rejected and miss-match free duplexes are 
enriched 




(ii) complete digestion of the mis-match containing 

heteroduplexes and blockade of the 3' ends to prevent strand extension 
such that only those strands incorporated in mis-match free duplex 
formations can achieve exponential amplification and enrichment within the 
reaction. 
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) Complete cleavage at mismatch and blocking of the 
fragments (*) ensures that they do not participate in the 
subsequent amplification f 




The selected duplexes represent VNTR alleles that are 
common to all individuals manifesting a particular trait of interest. These 
5 alleles may be hybridized to the VNTR alleles generated from individuals in 
which that trait is not present. A portion of the alleles that are common to 
all individuals manifesting a trait of interest will form mis-match free 
heteroduplexes with the alleles from those in which the trait is absent. 
Since these are non-informative for the trait of interest they are rejected. 

10 Heteroduplexes containing a mis-match, however, are selected by partial 
digestion, and incubation with a labelled dNTP and DNA polymerase to 
allow separation of these alleles from the reaction. The selected 
heteroduplexes represent VNTRs that are linked to the particular trait of 
interest. Amplification of these VNTRs from DNA of individuals manifesting 

15 the trait yields the informative alleles which constitute a Total 
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Representation of Alleles Informative for a Irait (TRAIT). 




Strand extention with hapten labelled 
nucleotides (*) and replacement of existing 
strand occurs only where alleles are 1 
not shared. These labelled duplexes ? 
represent VNTRs linked to the trait of 
interest and are selected 




★ * ★ * ★ 




Alternatively, the selected alleles that are common to 
individuals manifesting a particular trait may be hybridized to an array of 
fixed alleles of known identity and spatial separation. By mis-match 
discrimination, the identity of alleles forming mis-match free duplexes can 
be established. 



Th preferred embodiment 

(0 Genomic DNA of an individual of the species under 
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investigation, but not necessarily an individual in that investigation is 
fragmented with a single restriction enzyme. 

00 Adaptors with a recessed 3' end are ligated to all genomic 

fragments. 

^ (HO All 3' ends are terminated by incubation of the ligated 

fragments with ddNTPs in the presence of Taq DNA polymerase. 
(«v) The ligated fragments are purified from the ddNTPs and 

amplified in reactions containing: 

a) adaptor primer and a (CA) n D primer, where D=G+A+T; 
' b) adaptor primer and an (AC) n B primer, where B=G + T + C; 

c) adaptor primer and a (GU) n H primer, where H=A + T+C; 

d) adaptor primer and a (UG) n V primer, where V-G+A+c! 

The products of amplification result from genomic fragments 
that successfully ligate to the chosen adaptor and contain a VNTR with 
homology to the chosen primer. 

M digestion of the (GU) n H primed products and the (UG) n V 

pnmed products with uracil-DNA g,ycosylase fol.owed by Exonuc.ease 1 to 
remove all VNTR sequences and excess VNTR primer. 
<*> digestion of (CA) n D primed products and (AC) n B primed 

products with Exonuclease 1 to remove all excess VNTR primer Size 
se.ect.on may be performed to obtain products of an optimal range of 
molecular weights. 

CN) Hybridization of equal amounts of (GU) n H primed products 

and the (UG) n V primed products wrth an equimolar quantity of pooled 
genomic DNAs derived from individuals manifesting a particular trait of 
interest. These DNAs act as template for generation of VNTR alleles 
contained within them. 

(viii) Incubation of the hybridized products with Taq DNA polymerase to 

achieve strand extension of all annealed 3' ends. 

fx) Addition of equal amounts of (CA) n D primed products and 
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(AC)nB primed products to the reaction and generation of VNTR alleles 
from the •genomic template' by thermal cycling in the presence of Taq DNA 
polymerase. 

00 Purification of the generated VNTR alleles followed by strand 

dissociation and reannealing under stringent conditions. 

( xi ) Partial digestion with S1 nuclease of mis-match containing 
heteroduplexes that result from hybridization of VNTR alleles to spurious 
products of amplification, or hybridization of VNTR alleles that differ among 
the individuals under investigation manifesting a particular trait of interest. 

(xii) Incubation of the hybridized alleles with biotin dUTP in the 
presence of Taq DNA polymerase such that the digested heteroduplexes 
incorporate biotin. 

(xiii) Separation of biotin containing molecules from the reaction by 
passage over streptavidin bound to a solid support. The remaining mis- 
match free duplexes are selected. 

(xiv) Repetition of hybridization, digestion and separation of 
heteroduplex molecules. This enriches the reaction in VNTR alleles that 
are common to all manifesting the particular trait of interest and removes 
any spurious products of amplification. 

(xv) Hybridization of the selected VNTR alleles, that are common 
to all individuals manifesting a particular trait, to the VNTR alleles of 
individuals in which the trait is absent that have been generated from their 
genomic DNAs in a method bearing similarity, wholly or in part, with (i) 

to (ix). 

(xvi) Partial digestion with S1 nuclease of mis-match containing 
heteroduplexes followed by incubation with biotin-dUTPs such that the 
digested heteroduplexes incorporate biotin. 

(xvii) Selection of biotin containing molecules from the reaction by 
passage over streptavidin bound to a solid support. These selected 
heteroduplexes represent VNTRs that are linked to the particular trait of 
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nteres . The .formative a,le,e of these VNTRs, with respect to the trait of 
-merest, can be established from the pool of alle.es common to all 
.nd,v,dua.s manifesting that trait by various methods including but not 
1'm.ted amplification using VNTR specific primers. 

5 

A second embodiment 

(') Genomic DNA of an individual of the species under 

-nvest.gation, but not necessarily an individual in that investigation is 
fragmented with a single restriction enzyme. 

° 00 Adaptors with a recessed 3' end are ligated to all genomic 

fragments. 

('») All 3' ends are terminated by incubation of the ligated 

fragments with ddNTPs in the presence of Taq DNA polymerase. 
M The ligated fragments are purified from the ddNTPs and 

amplified in reactions containing: 

a) adaptor primer and a (CA) n D primer, where D=G+A+T; 

b) adaptor primer and an (AC) n B primer, where B=G + T + C; 

c) adaptor primer and a (GU) n H primer, where H=A+T + C; 

d) adaptor primer and a (UG) n V primer, where V=G+A + C. 

The products of amplification result from genomic fragments 
hat successfully Ngated to the chosen adaptor and contain a VNTR with 
homology to the chosen primer. 

M digestion of the (GU)nH primed products and the (UG) n V 

pnmed products with uracil-DNA glycosylase followed by exonuclease 1 ,o 
remove all VNTR sequences and excess VNTR primer. 
<"> digestion of (CA) n D primed products and (AC) n B primed 

products with exonuclease 1 to remove a„ excess VNTR primer Size 
selection may be performed to obtain products Can optima, range of 
molecular weights. 

(vii) Hybridization of equal amounts of (GU) n H primed products 
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and the (UG) n V primed products with an equimolar quantity of pooled 
genomic DNAs derived from individuals manifesting a particular trait of 
interest. 

(viii) Incubation of the hybridized products with Taq DNA polymerase to 
achieve strand extension of all annealed 3' ends. 

(ix) Addition of equal amounts of (CA) n D primed products and 
(AC) n B primed products to the reaction and generation of VNTR alleles 
from 'genomic template' by thermal cycling in the presence of Taq DNA 
polymerase. 

00 Purification of the generated VNTR alleles followed by strand 

dissociation and reannealing under stringent conditions. 

(xi) Partial digestion with S1 nuclease of mis-match containing 

heteroduplexes that result from hybridization of VNTR alleles to spurious 

products of amplification, and hybridization of VNTR alleles that differ 

among the individuals under investigation manifesting a particular trait of 

interest. 

(x>>) Incubation of the hybridized alleles with biotin dUTP in the 

presence of Taq DNA polymerase such that the digested heteroduplexes 
incorporate biotin. 

(xiii) Separation of biotin containing molecules from the reaction by 
passage over streptavidin bound to a solid support. The remaining mis- 
match free duplexes are selected. 

(xiv) Repetition of hybridization, digestion and separation of 
heteroduplex molecules. This enriches the reaction in VNTR alleles that 
are common to all manifesting the particular trait of interest and removes 
any spurious products of amplification. 

(xv) Hybridization of the alleles common to all individuals 
manifesting a particular trait of interest to a fixed array of all possible VNTR 
alleles, or subset thereof, on a solid support of known identity and spatial 
separation under stringent conditions. 



-26- 



(xvi) Partial digestion with S1 nuclease of mis-match containing 
heteroduplexes followed stringent washing of the array to remove all 
unbound and digested strands. 

(xvii) Detection of alleles forming mis-match free duplexes by 
observation of fluorescence, providing the VNTR alleles fixed to the array 
are appropriately labelled. The identity of these alleles is established if the 
identity and spatial separation of the alleles fixed on the array is known. 

A third embodiment 

(') VNTR alleles are generated by means other than processes 

of amplification of fragmented and ligated genomic DNA with adaptor 
primer and VNTR primer, hybridization of the generated products to 
genomic 'template' DNAs of individuals manifesting a particular trait, and 
generation of the respective VNTR alleles from those template DNAs. 
These may include but are not limited to: 

a) amplification of VNTRs from genomic or synthetic DNA using 
primers specific to the flanking regions of each VNTR in individual 
reactions; 

b) amplification of VNTRs from genomic or synthetic DNA using a 
multiplex system, thereby allowing amplification of multiple VNTRs on 
mass using adapted VNTR specific primers; 

c) amplification of VNTRs from genomic or synthetic DNA using an 
endonuclease that cleaves in or about VNTR sequences such that 
adaptors may be ligated to the digested DNA and used for amplification of 
the VNTR alleles; 

d) generation of a pool of VNTRs from individuals manifesting a 
particular trait by processes of subtraction with those in which the trait is 
absent. 

('■) Purification of the generated VNTR alleles followed by strand 

dissociation and reannealing under stringent conditions. 
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('") Partial digestion with S1 nuclease of mis-match containing 

heteroduplexes that result from hybridization of VNTR alleles to spurious 
products of amplification, and hybridization of VNTR alleles that differ 
among the individuals under investigation manifesting a particular trait of 
interest. 

( iv ) Incubation of the hybridized alleles with biotin dUTP in the 
presence of Taq DNA polymerase such that the digested heteroduplexes 
incorporate biotin. 

( v ) Separation of biotin containing molecules from the reaction 
by passage over streptavidin bound to a solid support. The remaining mis- 
match free duplexes are selected. 

(vi) Repetition of hybridization, digestion and separation of 
heteroduplex molecules. This enriches the reaction in VNTR alleles that 
are common to all manifesting the particular trait of interest and removes 
any spurious products of amplification. 

(xv) Hybridization of the alleles common to all individuals 
manifesting a particular trait of interest to a fixed array of all possible VNTR 
alleles, or subset thereof, on a solid support of known spatial separation 
under stringent conditions. 

(xvi) Partial digestion with S1 nuclease of heteroduplexes followed 
stringent washing of the array to remove all unbound strands and digested 
molecules. 

(xvii) Detection of alleles^forming mis-match free duplexes by 
observation of fluorescence, providing the VNTR alleles fixed to the array 
are appropriately labelled. The identity of these alleles is established if the 
identity and spatial separation of the alleles fixed on the array is known. 

Those that are skilled in the art will appreciate that there are 
several methods of differentiating mis-match containing duplexes from 
those that are free of mis-matches, either in solution or on an array. The 
methods described in the above embodiments represent only one of these 



-28- 



methods. 

Those that are skilled in the art will appreciate that the 
invention is equally well suited any type of VNTR including but not 
restricted to dinucleotide repeats e.g.(CA) n and (GT) n , trinucleotide 
repeats e.g.(AAT) n , (AGC)n, (AGG) n , (CAC) n> (CCG) n and (CTT) n , and 
tetranucleotide repeats e.g.(CCTA) n , (CTGT) n , (CTTT) n .(TAGG) n , 
(TCTA)n, and (TTCC) n . In addition, the invention may be applied to simple 
organism microsatellites that include, but are not limited to, (AT), (CC), 
(CT) and (GA) rich tracts of repetitive motifs. 

Those that are skilled in the art will appreciate that polymorphic alleles, 
other than those of VNTRs, may be used with the invention to produce 
alleles that are free of spurious products of amplification and are common 
to all individuals manifesting a particular trait. These polymorphic alleles 
may be hybridized to a fixed array of all possible alleles, or subset thereof, 
or to a pool of alleles derived from individuals in which that trait is absent. 
By mis-match discrimination using S1 nuclease and incorporation of biotin, 
those alleles linked and informative for a trait can be identified. 

Those that are skilled in the art will appreciate that alleles from 
the genome of a single individual, or more than one individual, of unknown 
phenotype and genotype may be amplified with fidelity, removing the 
spurious products of amplification by mis-match discrimination, and 
hybridized to a fixed array of alleles, or to a pool of alleles in solution, in 
order assign a genotype or a phenotype to that individual. 

Those that are skilled in the art will appreciate that mis-match 
discrimination may be performed using enzymes or chemicals other than 
SI nuclease. These alternatives include but are not limited to Mung Bean 
nuclease, mutation detection proteins (e.g. Mut S), recombinases (e.g. 
Endonuclease VII), osmium tetroxide and hydroxylamine. 

Those that are skilled in the art will appreciate that haptens 
other than biotin, using chemicals with affinity to the chosen hapten bound 
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on a solid support, may be used to achieve separation of duplexes with or 
without mis-matches after mis-match discrimination. 

The invention differs fundamentally from all previous 
techniques since genomic fragments are generated that do not reflect the 
polymorphic variation at the locus from which they were derived. 
Furthermore, these fragments need not be generated from an individual in 
a particular investigation, but may be from any individual of the appropriate 
species. However, hybridization of these fragments to genomic •template* 
DNA of an individual subject to investigation and mis-match discrimination 
permits amplification, with fidelity, of alleles within that genomic template 
whilst overcoming the problems of generation of spurious products that are 
a feature of other PCR-based methods. Since the invention preserves 
each VNTR allele with its flanking sequences, these alleles remain highly 
informative. In this respect the invention is unique. Furthermore, this novel 
method of generating VNTRs is rapid, inexpensive, has no requirement for 
prior knowledge of sequence, and has no requirement for elaborate 
equipment, it is of immense importance obviating the high investment of 
time and money that is currently required for isolation of VNTRs. 
Consequently, the application of technologies dependant on the availability 
of VNTR in species in which none have been isolated will be possible 
where previously this was unfeasible. The ability to generate large 
numbers of VNTRs from all species quickly, efficiently, cheaply and with 
fidelity is a considerable contribution of the present invention to workers in 
the to the biomedical field. 

In summary, the invention involves a novel method of 
generating VNTRs encompassing restriction endonuclease digestion of 
DNA, ligation of the fragments to adaptors and, by introduction of a primer 
with sequence homology to a chosen VNTR, amplifying only those 
fragments that are flanked by a chosen endonuclease restriction enzyme 
site and a VNTR. These fragments are not representative of the alleles of 
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each VNTR and need not be generated from any specific individual under 
investigation. Hybridization of these fragments with genomic DNA of the 
individuals under investigation recreates the intact VNTR alleles with 
flanking sequence, as they occur in the genome. This in itself constitutes a 
major step in the ability of workers in the biomedical fields to generate 
quickly, efficiently, cheaply and with fidelity VNTRs in all species for 
purposes reliant on the availability of VNTRs, including but not confined to 
DNA fingerprinting and linkage analysis. The incorporation of a mis-match 
discrimination procedure overcomes the problems of miss-priming and 
generation of spurious products by reaction contamination and subtle 
variation in reaction conditions, that are to the detriment of all PCR-based 
technologies, and allows exclusion of alleles that are not common to all 
individuals under investigation that manifest a particular trait. A second 
round of mis-match discrimination removes un-informative alleles that are 
present in the genomes of individuals that do not manifest the trait. This 
procedure is designated a Total Representation of Alleles that are 
Informative for a Irait (TRAIT). The invention, therefore, has significant 
advantages over previous methods, embracing the speed of analysis of 
AFLP, RDA and RAPD, and the high polymorphism detection rate of 
linkage analysis, but negating the need for DNA from closely related 
individuals and for paternity testing. The invention also overcomes 
fundamental problems that are a feature of PCR based technologies, 
including miss-priming and generation of spurious products through 
reaction contamination and subtle variations in the conditions of reaction. 
Furthermore, there is no requirement for expensive equipment or elaborate 
statistical computer software. The analysis will give rise to alleles that are 
both linked and informative, being present in individuals manifesting the 
trait of interest but absent in those individuals that lack the trait. In this 
respect, the invention is unchallenged in its superiority over all other 
methods. 
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The invention allows concomitant detection of polymorphisms 
at multiple loci by simultaneous comparison of simple or complex genomes 
from multiple individuals and differs fundamentally from all other 
techniques that have been previously employed. The invention represents 
a major advance in the ability of workers in the biomedical fields to 
generate VNTRs from the genomes of any species quickly, efficiently, 
cheaply and with fidelity in addition to screening complex genomes for 
polymorphisms co-segregating with hereditary traits. Application of this 
procedure will therefore facilitate the development of markers for genetic 
screening for hereditary disease, or advantageous monogenic or polygenic 
traits in all organisms. 

Examples of how the invention may be applied 

The following illustrations represent examples of how the 
invention may be applied without inferring any limitation to scope of the 
invention or any limitation to the different ways in which the invention may 
be applied. 

Example 1 

In this example the (CA) n /(GT) n VNTRs are generated from 
genomic DNA fragmented by a single restriction enzyme. The amplified 
products are hybridized to two pools of genomic DNA. One pool is derived 
from individuals manifesting a particular hereditary trait. The second pool 
is derived from individuals in which that trait is absent. Mis-match 
discrimination is performed solely by partial digestion of mis-match 
containing heteroduplexes. 

(0 restriction digestion of genomic DNA using Rsa1 buffered 

appropriately at 37°C for several hours to yield genomic fragments. 
00 Annealing of the sense and antisense oligonucleotides that 

comprise the adaptor (a 48mer and 12mer respectively). This may be 
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achieved by mixing equimolar quantities of each oligo in ligase buffer and 
heating to 50°C followed by gradual cooling to 10°C over 1 hour. 
('") Ligating the adaptor to the genomic fragments by addition of 

the fragments to the buffered adaptor and incubating at 1 5°C over night in 
the presence of T4 DNA ligase. 

(iv) Removal of the ligase buffer and incubation of the ligated 
fragments with dideoxynucleotide triphosphates (ddNTPs) in the presence 
of Taq DNA polymerase appropriately buffered at 72°C for 1 hour. 
Complete termination of the recessed 3' ends of the adaptor can be 
assessed by PCR of an aliquot of the reaction with adaptor primer which 
should fail to yield any products of amplification. 

( v ) Purification of the ligated fragments from the ddNTPs and 
amplification of those ligated fragments containing a (CA) n /(GT) n VNTR 
under appropriate conditions with either a (CA) n primer and the adaptor 
primer, or a (GT) n primer and the adaptor primer. Over amplification of 
products should be avoided to limit the opportunity for formation of 
concatamers. Using PCR a total of about 25 cycles may be appropriate, 
though NASBA or other methods may be employed to achieve the same 
goal. 

(vi) The (CA) n primers and the (GT) n primers are destroyed by 
physical separation of molecules based on molecular weight, 
electrophoretic mobility, or by digestion of single stranded DNA using an 
appropriate enzymes such as Exonclease 1 appropriately buffered at 37°C. 

(vii) hybridization of the (CA) n primed products to an equimolar 
quantity of the pooled genomic DNAs of all individuals under investigation 
manifesting the particular trait of interest. Hybridization should proceed 
under stringent conditions of high temperature and low salt concentration. 
Strand dissociation at 98°C for 5 minutes followed by a step-wise decrease 
in temperature to 78°C over several hours in solution of limited volume 
containing 0.2M NaCI may be appropriate. 
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(viii) incubation of the hybridization components with Taq DNA 
polymerase appropriately buffered at 72°C to achieve extension of 
annealed strands. 

(ix) addition of an amount of (GT) n primed products, similar to 
5 that of the (CA) n primed products, and adaptor primer to the reaction 

followed by amplification of products to yield VNTR alleles faithfully 
reproduced from the genomic templates of individuals manifesting the trait 
of interest. This may be performed by PCR, NASBA or other methods. 

(x) dissociation of the amplified VNTR alleles by incubation at 
10 98°C for 5 minutes then step-wise reduction in temperature to 78°C over 

several hours in a limited reaction volume containing 0.2M NaCI. This 
allows the alleles at each locus to hybridize to the alleles derived from 
other individuals resulting in mis-match free duplex formation where alleles 
are in common, or duplexes with a mis-match where alleles differ among 
is individuals. 

(xi) Incubation of the hybridized strands with S1 nuclease under 
appropriate conditions to achieve partial digestion of all mis-match 
containing duplex molecules. 

(xii) incubation of the reaction components with biotin-dUTP 

20 containing dNTPs in the presence of Taq DNA polymerase appropriately 
buffered at 72°C for 1 hour to achieve extension of the 3* ends of all 
digested strands such that mis-match containing duplexes incorporate 
biotin. 

(xiii) physical separation of biotin containing duplexes due by 
25 binding to streptavidin on a rigid support. 

(xiv) repeated dissociation and reannealing of the reaction 
components followed by heteroduplex digestion and biotin incorporation to 
permits enrichment of alleles common to all individuals manifesting the trait 
of interest. 

30 (xv) amplification of the VNTR alleles of the pool of individuals in 
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which the trait of interest in absent, as described in (i) to (be), 
(xvl) hybridization of the enriched VNTR alleles forming mis-match 

free duplexes, that are common to all individuals manifesting a the trait of 
•nterest, with the VNTR alleles derived from those individuals in which the 
> trart » absent. This may be achieved by strand dissociation at 98°C for 5 
m.nutes then step-wise reduction in temperature over several hours to 
78°C in a limited reaction volume containing 0.2M NaCI. 

(xvii) partial digestion of the mis-match containing duplexes with 
S1 nuclease under appropriate conditions. 

(xviii) incubation of the reaction components with biotin-dUTP 
containing dNTPs in the presence of Taq DNA polymerase appropriately 
buffered at 72°C for 1 hour. 

(xix) separation of the biotin containing duplex molecules formed 
by the association of alleles common to individuals manifesting the trait of 
-nterest that are absent in those in which the trait is not present. The biotin 
eonta,n,ng molecules are selected and represent VNTRs that are linked to 
the trait of interest. The informative alleles of these VNTRs can be 
amplified from the DNAs of individuals manifesting the trait of interest. 

Example 2 

In this example the (CA) n /(GT) n VNTRs are generated from 
genomic DNA fragmented by a single restriction enzyme. The amplified 
products are hybridized to two pools of genomic DNA. One pool is derived 
from .ndividuals manifesting a particular hereditary trait. The second pool 
is derived from individuals in which that trait is absent. Mis-match 
discrimination among all individuals manifesting the particular trait of 
-nterest is performed by complete digestion of the mis-match containing 
duplexes followed by enrichment of mis-match free duplexes by 
amplification. The amplified products represent alleles that a common to 
all .nd.v.duals manifesting the trait of interest. This is a less preferred 
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embodiment since the amplification of the selected alleles may yield 
spurious products by miss-priming events or reaction contamination. 
However, hybridization of these alleles to the alleles derived from 
individuals in which the trait is absent followed by partial digestion of any 
5 mis-match containing duplexes may be performed. 

(i) restriction digestion of genomic DNA using Rsa1 buffered 
appropriately at 37°C for several hours to yield genomic fragments. 

(ii) Annealing of the sense and antisense oligonucleotides that 
comprise the adaptor (a 48mer and 12mer respectively). This may be 

10 achieved by mixing equimolar quantities of each oligo in ligase buffer and 
heating to 50°C followed by gradual cooling to 10°C over 1 hour. 

(iii) Ligating the adaptor to the genomic fragments by addition of 
the fragments to the buffered adaptor and incubating at 15°C over night in 
the presence of T4 DNA ligase. 

15 (iv) Removal of the ligase buffer and incubation of the ligated 

fragments with dideoxynucleotide triphosphates (ddNTPs) in the presence 
of Taq DNA polymerase appropriately buffered at 72°C for 1 hour. 
Complete termination of the recessed 3' ends of the adaptor can be 
assessed by PCR of an aliquot of the reaction with adaptor primer which 

20 should fail to yield any products of amplification. 

(v) Purification of the ligated fragments from the ddNTPs and 
amplification of those ligated fragments containing a (CA) n /(GT) n VNTR 
under appropriate conditions wittreither a (CA) n primer and the adaptor 
primer, or a (GT) n primer and the adaptor primer. Over amplification of 

25 products should be avoided to limit the opportunity for formation of 

concatamers. Using PCR a total of about 25 cycles may be appropriate, 
though NASBA or other methods may be employed to achieve the same 
goal. 

(vi) The (CA) n primers and the (GT) n primers are destroyed by 
30 physical separation of molecules based on molecular weight, 
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electrophoretic mobility, or by digestion of single stranded DNA using an 
appropriate enzymes such as Exonclease 1 appropriately buffered at 37°C. 
(vii) hybridization of the (CA) n primed products to an equimolar 

quantity of the pooled genomic DNAs of all individuals under investigation 
manifesting the particular trait of interest. Hybridization should proceed 
under stringent conditions of high temperature and low salt concentration. 
Strand dissociation at 98°C for 5 minutes followed by a step-wise decrease 
in temperature to 78°C over several hours in solution of limited volume 
containing 0.2M NaCI may be appropriate. 

(vi'O incubation of the hybridization components with Taq DNA 

polymerase appropriately buffered at 72°C to achieve extension of 
annealed strands. 

( ix ) addition of an amount of (GT) n primed products, similar to 
that of the (CA) n primed products, and adaptor primer to the reaction 
followed by amplification of products to yield VNTR alleles faithfully 
reproduced from the genomic templates of individuals manifesting the trait 
of interest. This may be performed by PCR, NASBA or other methods. 

( x ) dissociation of the amplified VNTR alleles by incubation at 
98°C for 5 minutes then step-wise reduction in temperature to 78°C over 
several hours in a limited reaction volume containing 0.2M NaCI. This 
allows the alleles at each locus to hybridize to the alleles derived from 
other individuals resulting in mis-match free duplex formation where alleles 
are in common, or duplexes containing a mis-match where alleles differ 
among individuals. 

( xi ) Incubation of the hybridized strands with S1 nuclease under 
appropriate conditions to achieve complete digestion of all mis-match 
duplexes. 

( x ) purification of the reaction components and incubation with 

ddUTP containing dNTPs in the presence of Taq DNA polymerase at 72°C 
for 1 hour to block all 3' ends of the annealed strands. 
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(xi) purification of the reaction components and amplification of 
mis-match free duplexes by PGR, NASBA, or other methods. 

(xii) repeated dissociation and reannealing of the reaction 
components followed by heteroduplex digestion, and incubation with 

5 ddUTP containing dNTPs to achieve enrichment of alleles common to all 
individuals manifesting the trait of interest by PCR, NASBA or other 
methods. 

(xiii) amplification of the VNTR alleles of the pool of individuals in 
which the trait of interest in absent, as described in (i) to (ix). 

10 (xiv) hybridization of the enriched VNTR alleles that are common 

to all individuals manifesting a the trait of interest with the VNTR alleles 
derived from those individuals in which the trait is absent. This may be 
achieved by strand dissociation at 98°C for 5 minutes then step-wise 
reduction in temperature over several hours to 78°C in a limited reaction 

15 volume containing 0.2M NaCI. 

(xv) partial digestion of the mis-match containing duplexes with 

SI nuclease under appropriate conditions. 

(xviii) incubation of the reaction components with biotin-dUTP 
containing dNTPs in the presence of Taq DNA polymerase appropriately 

20 buffered at 72°C for 1 hour. 

(xix) separation of the biotin containing duplex molecules formed 
by the association of alleles that differ between individuals manifesting the 
trait of interest and those in which the trait is absent. The biotin containing 
molecules are selected and represent VNTRs that are linked to the trait of 

25 interest. The informative alleles of these VNTRs can be amplified from the 
DNAs of individuals manifesting the trait of interest. 
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Example 3 

In this example the (CA) n /(GT) n VNTRs are generated from 
genomic DNA fragmented by a single restriction enzyme. The amplified 
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products are hybridized to the genomic 'template' DNA of a single 
individual manifesting a particular trait of interest to yield the VNTR alleles 
represented in that genomic 'template' DNA. By hybridization of the 
amplified VNTR alleles to a sample of the same genomic 'template' DNA, 
partial digestion of any mis-match containing duplexes, and incubation of 
the products with biotin-dUTP, the spurious products of amplification due to 
reaction contamination and miss-priming events are removed. The non- 
biotin containing alleles forming mis-match free duplexes with the genomic 
'template' DNA are hybridized to the pooled VNTR alleles of individuals 
that do not manifest the trait of interest. Heteroduplexes containing mis- 
matches formed in this hybridization are selected by partial digestion and 
incubation of biotin-dUTP in the presence of Taq DNA polymerase. 
(') restriction digestion of genomic DNA using Rsa1 buffered 

appropriately at 37°C for several hours to yield genomic fragments. 
(») Annealing of the sense and antisense oligonucleotides that 

comprise the adaptor (a 48mer and 12mer respectively). This may be 
achieved by mixing equimolar quantities of each oligo in ligase buffer and 
heating to 50°C followed by gradual cooling to 10°C over 1 hour. 
("') Ligating the adaptor to the genomic fragments by addition of 

the fragments to the buffered adaptor and incubating at 15°C over night in 
the presence of T4 DNA ligase. 

(iv) Removal of the ligase buffer and incubation of the ligated 
fragments with dideoxynucleotide triphosphates (ddNTPs) in the presence 
of taq DNA polymerase appropriately buffered at 72°C for 1 hour. 
Complete termination of the recessed 3" ends of the adaptor can be 
assessed by PCR of an aliquot of the reaction with adaptor primer which 
should fail to yield any products of amplification. 

(v) Purification of the ligated fragments from the ddNTPs and 
amplification of those ligated fragments containing a (CA) n /(GT) n VNTR 
under appropriate conditions with either a (CA) n primer and the adaptor 
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primer, or a (GT) n primer and the adaptor primer. Over amplification of 
products should be avoided to limit the opportunity for formation of 
concatamers. Using PCR a total of about 25 cycles may be appropriate, 
though NASBA or other methods may be employed to achieve the same 
5 goal. 

(vi) The (CA) n primers and the (GT) n primers are destroyed by 

physical separation of molecules based on molecular weight, 
electrophoretic mobility, or by digestion of single stranded DNA using an 
appropriate enzymes such as Exonclease 1 appropriately buffered at 37°C. 

io (vii) hybridization of the (CA) n primed products to an equimolar 

quantity of the genomic 'template 1 DNA of the single individual under 
investigation manifesting the particular trait of interest. Hybridization 
should proceed under stringent conditions of high temperature and low salt 
concentration. Strand dissociation at 98°C for 5 minutes followed by a 

15 step-wise decrease in temperature to 78°C over several hours in solution 
of limited volume containing 0.2M NaCI may be appropriate, 
(viii) incubation of the hybridization components with taq DNA 

polymerase appropriately buffered at 72°C to achieve extension of 
annealed strands. 

20 (ix) addition of an amount of (GT) n primed products, similar to 

that of the (CA) n primed products, and adaptor primer to the reaction 
followed by amplification of products to yield VNTR alleles faithfully 
reproduced from the genomic template of the individual manifesting the 
trait of interest. This may be performed by PCR, NASBA or other methods. 

25 (x) addition of the VNTR alleles to a molar excess of the same 

genomic 'template 1 DNA from which the alleles were derived followed by 
dissociation of all strands by incubation at 98°C for 5 minutes then step- 
wise reduction in temperature to 78°C over several hours in a limited 
reaction volume containing 0.2M NaCI. This allows the alleles to hybridize 

30 to the appropriate locus on the genomic DNA. Any mis-match containing 



-40- 



duplexes .ha, may occur result from miss-priming events or reaction 

contamination. 

W> Partial digestion of the mis-match containing duplexes using 

S 1 nuclease under appropriate conditions. 

5 (xii) incubation of the reaction components with biotin-dUTP in the 

presence of Taq DNA polymerase appropriately buffered at 72>C for 1 

hour. 

m separation of the biotin-containing molecules by binding to 

streptavidin on a solid support. 

' (xiv) repeated hybridization of the VNTR alleles to the same 

genomic template' DNA followed by partial digestion of mis-match 
conta.ning duplex molecules and incubation with biotin-dUTP for removal 
of a„ spurious products of amplification by binding to streptavidin on a soiid 
support. 

<*v) amplification of the VNTR alleles derived from the pool of 

individual in which the trai, o, interest in absent, as described in (i, to (ix, 
<«<> hybridization of the VNTR aNeles. that represent products 

wh,ch have amplified with fidelity from the individual manifesting the 
part.cular trait of interest, to the poo, of VNTR alleles derived from 
individuals in which the trai, is no. present This may be achieved by 
strand dissocialion a. 98-C for 5 minutes then step-wise reduction in 
temperature over several hours to 78-C in a limited reaction volume 
containing 0.2M NaCI. 

(xvii) partial digestion of the mis-match containing duplexes with 

£>1 nuclease under appropriate conditions. 

<*ix) incubation of the reaction components with biotin-dUTP 

contain^ dNTPs in the presence of Ta q DNA polymerase appropriately 
buffered at 72°C for 1 hour. 

(«) separation of the biotin containing dup.ex molecules formed 

by the association of al.eles that differ between individual manifesting the 
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trait of interest and those in which the trait is absent. The biotin containing 
molecules are selected and may be further amplified. These selected 
alleles are present in the genome of the individual manifesting the trait of 
interest but are absent in those in which the trait is not present. 

Example 4 

In this example the (CA) n /(GT) n VNTRs are generated from 
genomic DNA fragmented by a single restriction enzyme. The amplified 
products are hybridized to a pool of genomic DNA derived from individuals 
manifesting a particular hereditary trait. Mis-match discrimination is 
performed by partial strand dissociation and reannealing, followed by 
partial digestion of mis-match containing duplexes by S1 nuclease. 
Digested duplexes are removed by biotin incorporation and binding of 
streptavidin. The alleles forming mis-match free duplexes are selected and 
hybridized to an array of VNTR alleles of known identity and spatial 
separation that are derived from the genomes of individuals in which the 
trait is absent. Providing the VNTR alleles fixed to the array are 
appropriately labelled, the identity of the alleles forming mis-match free 
duplexes with the array may be established. 

(') restriction digestion of genomic DNA using Rsa1 buffered 

appropriately at 37°C for several hours to yield genomic fragments. 

(") Annealing of the sense and antisense oligonucleotides that 

comprise the adaptor (a 48mer and 12mer respectively). This may be 

achieved by mixing equimolar quantities of each oligo in ligase buffer and 

heating to 50°C followed by gradual cooling to 10°C over 1 hour. 

("') Ligating the adaptor to the genomic fragments by addition of 

the fragments to the buffered adaptor and incubating at 15°C over night in 

the presence of T4 DNA ligase. 

(iv) Removal of the ligase buffer and incubation of the ligated 

fragments with dideoxynucleotide triphosphates (ddNTPs) in the presence 
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of taq DNA polymerase appropriately buffered at 72°C for 1 hour. 
Complete termination of the recessed 3 1 ends of the adaptor can be 
assessed by PCR of an aliquot of the reaction with adaptor primer which 
should fail to yield any products of amplification. 

5 (v) Purification of the ligated fragments from the ddNTPs and 

amplification of those ligated fragments containing a (CA) n /(GT) n VNTR 
under appropriate conditions with either a (CA) n primer and the adaptor 
primer, or a (GT) n primer and the adaptor primer. Over amplification of 
products should be avoided to limit the opportunity for formation of 

10 concatamers. Using PCR a total of about 25 cycles may be appropriate, 
though NASBA or other methods may be employed to achieve the same 
goal. 

(vi) The (CA) n primers and the (GT) n primers are destroyed by 
physical separation of molecules based on molecular weight, 

is electrophoretic mobility, or by digestion of single stranded DNA using an 
appropriate enzymes such as Exonclease 1 appropriately buffered at 37°C. 

(vii) hybridization of the (CA) n primed products to an equimolar 
quantity of the pooled genomic DNAs of all individuals under investigation 
manifesting the particular trait of interest. Hybridization should proceed 

20 under stringent conditions of high temperature and low salt concentration. 
Strand dissociation at 98°C for 5 minutes followed by a step-wise decrease 
in temperature to 78°C over several hours in solution of limited volume 
containing 0.2M NaCl may be appropriate. 

(viii) incubation of the hybridization components with Taq DNA 
25 polymerase appropriately buffered at 72°C to achieve extension of 

annealed strands. 

(ix) addition of an amount of (GT) n primed products, similar to 
that of the (CA) n primed products, and adaptor primer to the reaction 
followed by amplification of products to yield VNTR alleles faithfully 

so reproduced from the genomic templates of individuals manifesting the trait 
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of interest. This may be performed by PCR, NASBA or other methods. 
M dissociation of the amplified VNTR alleles by incubation at 

98°C for 5 minutes then step-wise reduction in temperature to 78°C over 
several hours in a limited reaction volume containing 0.2M NaCI. This 
allows the alleles at each locus to hybridize to the alleles derived from 
other individuals resulting in mis-match free duplex formation where alleles 
are in common, or duplexes containing mis-matches where alleles differ 
among individuals. 

( xi ) Incubation of the hybridized strands with S1 nuclease under 
appropriate conditions to achieve partial digestion of all mis-match 
containing duplex molecules. 

(xii) incubation of the reaction components with biotin-dUTP 
containing dNTPs in the presence of Taq DNA polymerase appropriately 
buffered at 72°C for 1 hour to achieve extension of the 3' ends of all 
digested strands such that digested duplexes incorporate biotin. 

(xiii) physical separation of biotin containing duplexes due by 
binding to streptavidin on a rigid support. 

(xiv) repeated dissociation and reannealing of the reaction 
components followed by digestion and biotin incorporation to permits 
enrichment of alleles common to all individuals manifesting the trait of 
interest. 

(xv) hybridization of the selected alleles forming mis-match free 
duplexes with the alleles of individuals in which the trait is absent that are 
fixed to a solid support with known spatial separation. This may be 
performed by addition of the VNTR alleles from individuals manifesting the 
trait of interest, that have been selected by mis-match discrimination, to the 
array of fixed alleles and incubating at 98°C for 5 minutes followed by step- 
wise decrease in temperature to 78°C over several hours in a limited 
volume containing 0.2M NaCI held on the array under a coverslip. 

(xvi) addition of S1 nuclease and partial digestion of all mis-match 
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containing heterocluplexes, under appropriate conditions. 

(xvii) stringent washing of the array to remove strands that failed to 
hybridize to the array and duplexes that have been digested This may be 
achieved by washing with SSC of low molar concentration at high 

5 temperature. 

(xviii) detection of those alleles that formed mis-match free 
duplexes with the array and remain intact. This may be achieved by 
observation of fluorescence providing VNTR alleles bound to the array are 
appropriately labelled. 

10 

Example 5 

In this example the (CA) n /(GT) n VNTRs are generated from 
genomic DNA fragmented by a single restriction enzyme. The amplified 
products are hybridized to a pool of genomic DNA derived from individuals 

15 manifesting a particular hereditary trait. Mis-match discrimination is 
performed by partial strand dissociation and reannealing, followed by 
partial digestion of mis-match containing duplexes by S1 nuclease. 
Digested duplexes are removed by biotin incorporation and binding of 
streptavidin. The alleles forming mis-match free duplexes are selected and 

20 hybridized to an array of VNTR alleles of known identity and spatial 

separation in which all possible alleles are represented, or subset thereof. 
Providing the VNTR alleles that are fixed to the array are appropriately 
labelled the identity of the alleles'forming mis-match free duplexes with the 
array may be established. 

25 (i) restriction digestion of genomic DNA using Rsa1 buffered 

appropriately at 37°C for several hours to yield genomic fragments, 
(ii) Annealing of the sense and antisense oligonucleotides that 

comprise the adaptor (a 48mer and 12mer respectively). This may be 
achieved by mixing equimolar quantities of each oligo in ligase buffer and 

30 heating to 50°C followed by gradual cooling to 10°C over 1 hour. 
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('») Ligating the adaptor to the genomic fragments by addition of 

the fragments to the buffered adaptor and incubating at 15°C over night in 
the presence of T4 DNA ligase. 

(iv) Removal of the ligase buffer and incubation of the ligated 
fragments with dideoxynucleotide triphosphates (ddNTPs) in the presence 
of taq DNA polymerase appropriately buffered at 72°C for 1 hour. 
Complete termination of the recessed 3' ends of the adaptor can be 
assessed by PCR of an aliquot of the reaction with adaptor primer which 
should fail to yield any products of amplification. 

(v) Purification of the ligated fragments from the ddNTPs and 
amplification of those ligated fragments containing a (CA) n /(GT) n VNTR 
under appropriate conditions with either a (CA) n primer and the adaptor 
primer, or a (GT) n primer and the adaptor primer. Over amplification of 
products should be avoided to limit the opportunity for formation of 
concatamers. Using PCR a total of about 25 cycles may be appropriate, 
though NASBA or other methods may be employed to achieve the same 
goal. 

( vi ) The (CA) n primers and the (GT) n primers are destroyed by 
physical separation of molecules based on molecular weight, 
electrophoretic mobility, or by digestion of single stranded DNA using an 
appropriate enzymes such as Exonclease 1 appropriately buffered at 37°C. 

( vii ) hybridization of the (CA) n primed products to an equimolar 
quantity of the pooled genomic DNAs of all individuals under investigation 
manifesting the particular trait of interest. Hybridization should proceed 
under stringent conditions of high temperature and low salt concentration. 
Strand dissociation at 98°C for 5 minutes followed by a step-wise decrease 
in temperature to 78°C over several hours in solution of limited volume 
containing 0.2M NaCI may be appropriate. 

(viii) incubation of the hybridization components with Taq DNA 
polymerase appropriately buffered at 72°C to achieve extension of 
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annealed strands. 

(ix) addition of an amount of (GT) n primed products, similar to 
that of the (CA) n primed products, and adaptor primer to the reaction 
followed by amplification of products to yield VNTR alleles faithfully 
reproduced from the genomic templates of individuals manifesting the trait 
of interest. This may be performed by PCR, NASBA or other methods. 

(x) dissociation of the amplified VNTR alleles by incubation at 
98°C for 5 minutes then step-wise reduction in temperature to 78°C over 
several hours in a limited reaction volume containing 0.2M NaCI. This 
allows the alleles at each locus to hybridize to the alleles derived from 
other individuals resulting in mis-match free duplexes where alleles are in 
common, or duplexes containing a mis-match where alleles differ among 
individuals. 

(xi) Incubation of the hybridized strands with S1 nuclease under 
appropriate conditions to achieve partial digestion of all mis-match 
containing duplex molecules. 

(xii) incubation of the reaction components with biotin-dUTP 
containing dNTPs in the presence of Taq DNA polymerase appropriately 
buffered at 72°C for 1 hour to achieve extension of the 3' ends of all 
digested strands such that digested duplexes incorporate biotin. 

(xiii) physical separation of biotin containing duplexes due by 
binding to streptavidin on a rigid support. 

(xiv) repeated dissociation and reannealing of the reaction 
components followed by heteroduplex digestion and biotin incorporation to 
permits enrichment of alleles common to all individuals manifesting the trait 
of interest. 

(xv) hybridization of the selected alleles forming mis-match free 
duplexes with the array of all possible alleles, or subset thereof, fixed to a 
solid support with known identity and spatial separation. This may be 
performed by addition of the VNTR alleles from individuals manifesting the 
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trait of interest, that have been selected by mis-match discrimination, to the 
array of fixed alleles and incubating at 98°C for 5 minutes followed by step- 
wise decrease in temperature to 78°C over several hours in a limited 
volume containing 0.2M NaCI held on the array under a coverslip. 
5 (xvi) addition of S1 nuclease and partial digestion of all mis-match 

containing duplexes, under appropriate conditions. 

(xvii) stringent washing of the array to remove strands that failed to 

hybridize to the array and digested duplexes. This may be achieved by 
washing, with SSC of low molar concentration at high temperature. 
10 (xviii) detection of those alleles that formed mis-match free 

duplexes with the array. This may be achieved by observation of 
fluorescence providing VNTR alleles fixed to the array are appropriately 
labelled. 

15 Example 6 

In this example the (CA) n /(GT) n VNTRs are generated from 
genomic DNA fragmented by a single restriction enzyme. The amplified 
products are hybridized to the genomic 'template' DNA of a single 
individual manifesting a particular trait of interest to yield the VNTR alleles 

20 represented in that genomic 'template' DNA. By hybridization of the 

amplified VNTR alleles to a sample of the same genomic 'template' DNA 
followed by partial digestion of any resulting mis-match containing 
duplexes and incubation of the products with biotin-dUTP, the spurious 
products of amplification due to reaction contamination and miss-priming 

25 events are removed. The non biotin containing alleles forming mis-match 
free duplexes are selected and hybridized to an array of VNTR alleles of 
known identity and spatial separation in which all possible alleles are 
represented, or a subset thereof. Providing VNTR alleles fixed to the array 
are appropriately labelled the identity of the alleles forming mis-match free 

30 duplexes with the array can be established. 
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(0 restriction digestion of genomic DNA using Rsa1 buffered 

appropriately at 37°C for several hours to yield genomic fragments. 
00 Annealing of the sense and antisense oligonucleotides that 

comprise the adaptor (a 48mer and 12mer respectively). This may be 
5 achieved by mixing equimolar quantities of each oligo in ligase buffer and 
heating to 50°C followed by gradual cooling to 10°C over 1 hour. 
0») Ligating the adaptor to the genomic fragments by addition of 

the fragments to the buffered adaptor and incubating at 15°C over night in 
the presence of T4 DNA ligase. 

io (iv) Removal of the ligase buffer and incubation of the ligated 

fragments with dideoxynucleotide triphosphates (ddNTPs) in the presence 
of taq DNA polymerase appropriately buffered at 72°C for 1 hour. 
Complete termination of the recessed 3* ends of the adaptor can be 
assessed by PCR of an aliquot of the reaction with adaptor primer which 

is should fail to yield any products of amplification. 

(v) Purification of the ligated fragments from the ddNTPs and 
amplification of those ligated fragments containing a (CA) n /(GT) n VNTR 
under appropriate conditions with either a (CA) n primer and the adaptor 
primer, or a (GT) n primer and the adaptor primer. Over amplification of 

20 products should be avoided to limit the opportunity for formation of 

concatamers. Using PCR a total of about 25 cycles may be appropriate, 
though NASBA or other methods may be employed to achieve the same 
goal. 

(vi) The (CA)n primers and the (GT) n primers are destroyed by 
25 physical separation of molecules based on molecular weight, 

electrophoretic mobility, or by digestion of single stranded DNA using an 
appropriate enzymes such as Exonclease 1 appropriately buffered at 37°C. 

(vii) hybridization of the (CA)n primed products to an equimolar 
quantity of the genomic 'template 1 DNA of the single individual under 

so investigation manifesting the particular trait of interest. Hybridization 
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should proceed under stringent conditions of high temperature and low salt 
concentration. Strand dissociation at 98°C for 5 minutes followed by a 
step-wise decrease in temperature to 78°C over several hours in solution 
of limited volume containing 0.2M NaCI may be appropriate. 

(viii) incubation of the hybridization components with taq DNA 
polymerase appropriately buffered at 72°C to achieve extension of 
annealed strands. 

(ix) addition of an amount of (GT) n primed products, similar to 
that of the (CA) n primed products, and adaptor primer to the reaction 
followed by amplification of products to yield VNTR alleles faithfully 
reproduced from the genomic template of the individual manifesting the 
trait of interest. This may be performed by PCR, NASBA or other methods. 

(x) addition of the VNTR alleles to a molar excess of the same 
genomic 'template' DNA from which the alleles were derived followed by 
dissociation of all strands by incubation at 98°C for 5 minutes then step- 
wise reduction in temperature to 78°C over several hours in a limited 
reaction volume containing 0.2M NaCI. This allows the alleles to hybridize 
to the appropriate locus on the genomic DNA. Heteroduplexes that may 
occur result from miss-priming events or reaction contamination. 

(xi) partial digestion of all mis-match containing duplexes using 
S1 nuclease under appropriate conditions. 

(xii) incubation of the reaction components with biotin-dUTP in the 
presence of Taq DNA polymerase appropriately buffered at 72°C for 1 
hour. 

(xiii) separation of the biotin-containing molecules by binding to 
streptavidin on a solid support. 

(xiv) repeated hybridization of the VNTR alleles to the same 
genomic •template* DNA followed by partial digestion of mis-match 
containing duplex molecules and incubation with biotin-dUTP for removal 
of all spurious products of amplification by binding to streptavidin on a solid 
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support. 

(xv) hybridization of the selected alleles with an array of all 
possible VNTR alleles that are fixed to a solid support with known identity 
and spatial separation This may be performed by addition of the selected 

5 VNTR alleles to the array of fixed alleles and incubating at 98°C for 5 
minutes followed by step-wise decrease in temperature to 78°C over 
several hours in a limited volume containing 0.2M NaCI held on the array 
under a coverslip. 

(xvi) addition of S1 nuclease and partial digestion of all mis-match 
10 containing duplexes, under appropriate conditions. 

(xvii) stringent washing of the array to remove strands that failed to 
hybridize to the array and digested duplexes. This may be achieved by 
washing with SSC of low molar concentration at high temperature. 

(xviii) detection of those alleles that formed mis-match free 
is duplexes with the array. This may be achieved by observation of 

fluorescence providing the VNTR alleles fixed to the array are appropriately 
labelled. 

Example 7 

2 ° in this example the (CA) n /(GT) n VNTRs are generated from 

genomic DNA fragmented by a single restriction enzyme. The amplified 
products are hybridized to the genomic template 1 DNA of a single 
individual manifesting a particulartrait of interest to yield the VNTR alleles 
represented in that genomic 'template' DNA. By strand dissociation and 

25 reannealing of the amplified VNTR alleles followed by partial digestion of 
any resulting mis-match containing duplexes and incubation of the 
products with biotin-dUTP, the spurious products of amplification due to 
reaction contamination and miss-priming events are removed. The non- 
biotin containing duplexes are selected and are hybridized to an array of all 

30 possible VNTR alleles, or subset thereof, that are fixed to a solid support of 
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known identity and spatial separation. Mis-match containing duplexes 
formed in hybridization to the array are partially digested. Providing VNTR 
alleles fixed to the array are appropriately labelled the identity of the alleles 
forming mis-match free duplexes with the array can be established; 
5 (>) restriction digestion of genomic DNA using Rsa1 buffered 

appropriately at 37°C for several hours to yield genomic fragments. 

(ii) Annealing of the sense and antisense oligonucleotides that 
comprise the adaptor (a 48mer and 12mer respectively). This may be 
achieved by mixing equimolar quantities of each oligo in ligase buffer and 

10 heating to 50°C followed by gradual cooling to 10°C over 1 hour. 

(iii) Ligating the adaptor to the genomic fragments by addition of 
the fragments to the buffered adaptor and incubating at 15°C over night in 
the presence of T4 DNA ligase. 

(iv) Removal of the ligase buffer and incubation of the ligated 

is fragments with dideoxynucleotide triphosphates (ddNTPs) in the presence 
of taq DNA polymerase appropriately buffered at 72°C for 1 hour. 
Complete termination of the recessed 3' ends of the adaptor can be 
assessed by PCR of an aliquot of the reaction with adaptor primer which 
should fail to yield any products of amplification. 

20 (v) Purification of the ligated fragments from the ddNTPs and 

amplification of those ligated fragments containing a (CA) n /(GT) n VNTR 
under appropriate conditions with either a (CA) n primer and the adaptor 
primer, or a (GT) n primer and the^adaptor primer. Over amplification of 
products should be avoided to limit the opportunity for formation of 

25 concatamers. Using PCR a total of about 25 cycles may be appropriate, 
though NASBA or other methods may be employed to achieve the same 
goal. 

(vi) The (CA) n primers and the (GT) n primers are destroyed by 

physical separation of molecules based on molecular weight, 
30 electrophoretic mobility, or by digestion of single stranded DNA using an 
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appropriate enzymes such as Exonclease 1 appropriately buffered at 37°C. 

(vii) hybridization of the (CA) n primed products to an equimolar 
quantity of the genomic 'template 1 DNA of the single individual under 
investigation manifesting the particular trait of interest. Hybridization 

5 should proceed under stringent conditions of high temperature and low salt 
concentration. Strand dissociation at 98°C for 5 minutes followed by a 
step-wise decrease in temperature to 78°C over several hours in solution 
of limited volume containing 0.2M NaCI may be appropriate. 

(viii) incubation of the hybridization components with taq DNA 
10 polymerase appropriately buffered at 72°C to achieve extension of 

annealed strands. 

(ix) addition of an amount of (GT) n primed products, similar to 
that of the (CA) n primed products, and adaptor primer to the reaction 
followed by amplification of products to yield VNTR alleles faithfully 

15 reproduced from the genomic template of the individual manifesting the 
trait of interest. This may be performed by PCR, NASBA or other methods. 

(x) strand dissociation and reannealing of the amplified VNTR 
alleles may be achieved by incubation at 98°C for 5 minutes then step-wise 
reduction in temperature to 78°C over several hours in a limited reaction 

20 volume containing 0.2M NaCI. Mis-match containing duplexes that may 
occur result from miss-priming events or reaction contamination. 

(xi) partial digestion of the mis-match containing duplexes using 
S1 nuclease under appropriate conditions. 

(xii) incubation of the reaction components with biotin-dUTP in the 
25 presence of Taq DNA polymerase appropriately buffered at 72°C for 1 

hour. 

(xiii) separation of the biotin-containing molecules by binding to 
streptavidin on a solid support. 

(xiv) repeated strand dissociation, reannealing, partial digestion 
30 and incubation with biotin-dUTP for removal of all spurious products of 
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amplification by binding of mis-matched duplexes to streptavidin on a solid 
support. 

(xv) hybridization of the selected mis-match free duplex forming 
alleles with an array of all possible VNTR alleles that are fixed to a solid 

5 support with known spatial separation This may be performed by addition 
of the selected VNTR alleles that have been selected by mis-match 
discrimination to the array of fixed alleles and incubating at 98°C for 5 
minutes followed by step-wise decrease in temperature to 78°C over 
several hours in a limited volume containing 0.2M NaCI held on the array 
10 under a coverslip. 

(xvi) addition of S1 nuclease and partial digestion of all mis-match 
containing duplexes, under appropriate conditions. 

(xvii) stringent washing of the array to remove strands that failed to 
hybridize to the array and digested duplexes. This may be achieved by 

is washing with SSC of low molar concentration at high temperature. 

(xviii) detection of those alleles that formed mis-match free 
duplexes with the array. This may be achieved by observation of 
fluorescence providing VNTR alleles fixed to the array are appropriately 
labelled. 

20 

Example 8 
Experimental Data 

1 . Preparatio n of amplimers using (CAV r ^and (GUV q primers 

25 (i) 2pg DNA was completely digested with 3pl Rsa I (lOu/pl; 

Promega) in a total volume of 100pl: 

8.5|jl genomic DNA (equivalent to 3pg DNA) 

10m! 10X reaction buffer 

3pl Rsa I (lOu/pl; Promega) 
30 78.5pl dH20 
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100pi 

The reaction was incubated at 37°C over night. 

The enzyme was heat inactivated by incubation at 70°C for 

20 minutes. 

The DNA was separated from the buffer using a 
microconcentrator (Microcon-100; Amicon). The volume of DNA recovered 
= 10|jl. 

(") 2nmoles 48mer and 2nmoles 12mer oligonucleotides that 

constitute the adaptor were combined; 

1 5.9pl 48mer (equivalent to 2 nmoles) 

13.7pl 12mer (equivalent to 2 nmoles) 

10pl 10X ligase buffer (NEB) 

48.4pl dH 2 Q 

88pl 

The mixture was heated to 50°C and allowed to cool to 10°C 

over 1 hour. 

To the 88pl of annealed adaptor was added the 1 Opl of 
digested DNA and ligation of the adaptor to the genomic fragments was 
performed: 

88pl annealed adaptor/ ligase buffer (containing ATP) 
10pl DNA 

2uLT4 DNA ligase (400 NEBu/pl) 
100pl 

The reaction was incubated at 16°C over night. 

The enzyme was heat inactivated by incubation at 70°C for 

20 minutes. 

The adaptor-ligated DNA fragments were separated from the 
buffer and non-ligated adaptor using a microconcentrator (Microcon-100; 
Amicon). The volume of DNA recovered = 12pl. 
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('») The adaptor-ligated DNA fragments were incubated with Taq 

DNA polymerase in the presence of dideoxynucleotide triphosphates to 
prevent 3" extension of the adaptor and non-ligated DNA in subsequent 
manipulations: 

1 2ul microconcentrated DNA 

3ul 10X NH4 reaction buffer 

1ul 50mM MgCl2 

lulddATP 

lulddCTP 

1 pi ddGTP 

iMlddTTP 

1 pi Taq DNA polymerase (5u/ul; Bioline) 

9yldH20 

30ul 

The reaction was incubated at 72°C for 2 hours. 

The adaptor-ligated DNA with terminated 3" ends was purified 
by phenol/chloroform extraction and microconcentration. 

The volume recovered was made up to 40ul and the 
concentration of DNA was gauged by gel quantification. The concentration 
of 75ng/ul was determined. 

(iv) (CA) primed amplimers and (GU) primed amplimers were 

generated in separate reactions: - 

10ul 10X NH4 reaction buffer 

8ul 50mM MgCl2 

1.5ul 10mM dNTPs 

1 pi adaptor-ligated DNA with terminated 3' ends 
4ul (CA) or (GU) primer (25pmol/ul) 
7JL5wldH20 
98pl 
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The reaction was overlaid with mineral oil. 

The reaction was heated to 95°C for 2 minutes, during which 
time Taq DNA polymerase (5u/pl; Bioiine) and 2pl adaptor primer 
(50pmol/pl) were added. 
5 Thermal cycling was performed as follows: 

95°C for 30 seconds, then 72°C for 45 seconds for a total of 
20 cycles, followed by 72°C for 5 minutes. 

To the 100pl of (CA) primed products was added 5pl 
Exonuclease I (10u/pl; Amersham Life Sciences) to remove the remaining 
10 (CA) primer. This reaction was incubated at 37°C for 30 minutes. 

To the 100pl of (GU) primed products was added 10pl uracil- 
DNA glycosylase (1u/pl; NEB) to digest all uracil incorporated into the PCR 
products. This reaction was incubated at 37°C for 2 hours. 1pl 10mM 
dNTPs was added followed by 2pl T4 DNA polymerase (5u/pl; Epicentre 
15 laboratories) to remove the protruding (CA) strand that complemented the 
digested (GU) sequence. This reaction was incubated at 37°C for 5 
minutes. 

Both the pools of amplimers were phenol/chloroform 
extracted and microconcentrated (Microcon-100; Amicon). For each pool, 
20 the volume recovered were made up to 500pl, of which 5pl was analysed 
by spectrophotometry to determine the concentration of DNA. 

2. Generation o f VNTRs from oehomic 'template' DNA 

Equal amounts of (CA) and (GU) primed amplimers were 
25 hybridized to genomic 'template' DNA of a single individual prior to thermal 
cycling. In order to gauge the optimal ratio of amplimer to genomic 
'template' DNA several reactions were performed using various amounts 
of 'template' DNA while keeping the amount of amplimers constant: 



30 
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, Template 1 DNA (ng) 
. Combined amplimers (ng) 
5M NaCI (pi) 
dH 2 0(pl) 

5 

Each reaction was overlaid with mineral oil- 
Each reaction was incubated at 98°C for 5 minutes, after 
which the temperature was reduced stepwise to 78°C over 4 hours. 
The following was added to each hybridization: 
io 5pl 1 0X NH4 reaction buffer 

4pl 50mM MgCl2 
0.75pl 10mM dNTPs 
0.5pl adaptor primer (50pmol/pl) 
34.2pl dH20 

15 Each reaction was spun briefly in a microfuge. 

Each reaction was heated to 72°C for 2 minutes and 0.5pl 
Taq DNA polymerase (5u/pl;BioIine) was added. The reactions were 
incubated at 72°C for a further 10 minutes, after which the temperature 
was raised to 95°C for 2 minutes. Thermal cycling was performed as 

20 follows; 

95°C for 30 seconds, then 72°C for 1 minute, for a total of 10 

cycles. 

For each reaction 10pl of products amplified for 10 cycles 
were added to 40pl of reaction mix and amplified under the same 
25 conditions for an additional 22 cycles. 5pl of the ends products of 

amplification were run on an agarose gel. The reaction containing 100ng 
genomic 'template* DNA was found to yield the most products of 
amplification, equivalent to a ratio of 100:1 by mass of genomic 'template* 
DNA: amplimer. 

30 The invention was validated by cloning the products of 



0 0.1 1 10 100 1000 
111111 
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To a final volume of 5.55pl 
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amplification. Two colonies of E.coli that had successfully transformed 
were cultured, from which plasmids were later harvested. These plasmids 
were sequenced and were found to contain VNTR sequences at the 
multiple cloning sites. 

5 
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CLAIMS 

1 ■ A method of making a mixture of VNTR alleles and their 

flanking regions of the genomic DNA of one or more members of a species 
of interest, which method comprises the steps of: 

a) dividing genomic DNA of the species of interest into 
fragments, 

b ) ligating to each end of each fragment an adaptor having a 
recessed 3'-end which is blocked to prevent enzymatic chain extension, 
thereby forming a mixture of adaptor-terminated fragments, 

c ) using a portion of the mixture of adaptor-terminated 
fragments as templates with an adaptor primer and a VNTR primer to 
create a mixture of 5-flanking VNTR amplimers, 

d ) using a portion of the mixture of adaptor-terminated 
fragments as templates with an adaptor primer and a VNTR antisense 
primer to create a mixture of 3'-flanking VNTR amplimers, 

e ) and using genomic DNA of the one or more members of the 
species of interest as template with the mixture of 5-flanking VNTR 
amplimers and the mixture of 3-fIanking VNTR amplimers as primers to 
make the desired mixture of VNTR alleles and their flanking regions. 

2 - A method as claimed in claim 1 wherein the strands of the 
mixture of VNTR alleles and their flanking regions are separated and then 
re-annealed and any mis-matches are separated and discarded. 

3 - A method as claimed in claim 1 or claim 2, wherein there is 
used in step e) genomic DNA of one or more members of the species of 
interest which manifest a trait of interest, whereby the resulting mixture of 
VNTR alleles and their flanking sequences is representative of those which 
manifest the trait of interest. 
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4. A method as claimed in claim 3, wherein the mixture of VNTR 

alleles and their flanking sequences representative of those which manifest 
the trait of interest is hybridized with a mixture of VNTR alleles and their 
flanking sequences representative of those which do not manifest the trait 
of interest, and mis-matches are selected to provide a mixture of VNTR 
alleles and their flanking sequences which are characteristic of the trait of 
interest. 

5 - A portion of genomic DNA of one or more members of a 

species of interest, said portion consisting essentially of a representative 
mixture of alleles of a chosen VNTR sequence and their flanking regions. 
6 * A portion as claimed in claim 5, wherein the mixture of alleles 

is representative of those which exhibit a trait of interest. 
7. A portion as claimed in claim 5 or claim 6, wherein each 

member of the mixture has an adaptor at its 3-end and its 5'-end. 

8 - A portion of genomic DNA of a species of interest, said 
portion consisting essentially of a representative mixture of 3-flanking 
regions of a chosen VNTR sequence, each member of the mixture carrying 
an adaptor at its 3*-end. 

9 - A portion of genomic DNA of a species of interest, said 
portion consisting essentially of a representative mixture of 5-flanking 
regions of a chosen VNTR sequence, each member of the mixture carrying 
an adaptor at its 5-end. 

1 0- A method of treating a mixture of polymorphic alleles, the 
mixture being representative of those which show a trait of interest, which 
method comprises separating and then re-annealing strands of the 
mixture, and separating and discarding any mis-matches. 

11- A method as claimed in claim 10, wherein the mixture of 
polymorphic alleles is a mixture of alleles of a chosen VNTR sequence and 
their flanking regions. 
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12 - A method as claimed in claim 10 or claim 11, comprising the 

additional step of hybridizing the said mixture with a mixture of 
corresponding polymorphic alleles which are representative of those which 
do not show the trait of interest, and selecting mis-matches to provide a 
5 mixture of polymorphic alleles which are characteristic of the trait of 
interest. 

13. A method as claimed in claims 1 1 and 12, wherein the 

mixture of corresponding polymorphic alleles is a mixture of alleles of the 
chosen VNTR sequence and their flanking regions, whereby there is 
10 obtained a mixture of VNTR alleles and their flanking sequences which are 
characteristic of the trait of interest. 

14 - A method of making a mixture of amplimers which method 

comprises the steps of: 

a ) dividing genomic DNA of one or more members of a species 
15 of interest into fragments, 

b ) ligating to each end of each fragment an adaptor having a 
recessed 3'-end which is blocked to prevent enzymatic chain extension, 
thereby forming a mixture of adaptor-terminated fragments, and 

c ) using a portion of the mixture of adaptor-terminated 

20 fragments as templates with an adaptor primer and a VNTR primer to 
create a mixture of 5-flanking VNTR amplimers, and/or 

d) using a portion of the mixture of adaptor-terminated 
fragments as templates with an adaptor primer and a VNTR antisense 
primer to create a mixture of 3'-fIanking VNTR amplimers. 

25 1 5 - A kit comprising protocols and reagents for performing the 

method of any one of claims 1 to 14. 
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ABSTRACT 

EXTRACTION A ND UTILISATION OF VNTR ALLELES 

The invention presented is a novel method for the extraction 
of VNTR alleles and for the concomitant detection of polymorphic markers 
for inherited traits at multiple loci by simultaneous comparison of complex 
genomes from multiple individuals. The product is designated a Total 
Representation of Alleles that are informative for a Irait (TRAIT). These 
alleles may be used directly as genetic markers or may be used as 
vehicles to facilitate precise localisation of sequence variations responsible 
for a hereditary trait all organisms. 



